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Formal concept analysis (FCA) is now widely recognized as a useful approach 
for extracting, representing, and analyzing knowledge in various domains. 
The high computational cost of knowledge processing and the difficulty of 
visualizing the lattice are two key challenges in practical FCA 
implementations. Moreover, assessing the finalized built-up lattice may be 


problematic due to the enormous number of formal concepts and the 

complexity of their connections. The challenge of constructing concept 
Keywords: lattices of adequate size and structure to convey high-importance context 
features remains a significant FCA aim. In the literature, various strategies for 
concept lattice reduction have been presented. In this work, we suggest a 
categorization of reduction methods for concept lattice based on three main 
categories: context pre-processing, non-essential distinctions elimination, and 
concept filtration, whereby using FCA-based analysis, the most important 
methods in the literature are analyzed and compared based on six pillars: the 
preliminary step of the reduction process, domain expert, changing the 
original data structure, final concept lattice, quality of reduction, and category 
of reduction method. 
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1. INTRODUCTION 

Formal concept analysis (FCA), is a practical conceptual model used to interpret, extract, and analyze 
knowledge. Wille's work in 1982 was instrumental in formalizing it [1]. He presented in his work the 
mathematical aspects of the FCA model. A key characteristic of the FCA is the fundamental incorporation of 
three main knowledge extraction components, namely, the exploration of data concepts (formal concepts in 
FCA), the exploration and interpretation of relationships in data (implications in FCA), and visualization of 
the information system in the form of a conceptual hierarchy (concept lattice). Each lattice element can be 
considered a formal concept. The extension component includes all objects sharing the same set of 
characteristics, while the intention part includes all attributes corresponding to the same set of objects [2]. 
Together, these make up a formal concept, a basic unit of concept that plays a vital role in knowledge 
processing. FCA has a wide range of applications in a variety of fields. Such as data mining [3], information 
retrieval [4], neural networks [5], [6], ontology engineering [7]-[9], reliability engineering [10], and social 
networks [11]-[16]. Make use of FCA modelling techniques in order to capture knowledge. A comprehensive 
survey of FCA applications in knowledge discovery and information science can be found in [17]. 
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The major factor in the success of FCA is the size of the final concept lattice that is build up when 
formal concepts are retrieved from their formal context in a sets of extents (set of objects) and set of intents 
(set of attributes). The usual characteristics of formal contexts are their size and complexity, as well as the 
presence of a great deal of information duplication. As a consequence of this, one of the primary problems that 
have been observed in practical implementations of FCA is the high computing cost of knowledge processing 
as well as the difficulty in representing the lattice. Moreover, assessing the finalized built-up lattice may be 
problematic due to the enormous number of formal concepts and the complexity of their connections. As a result, 
important traits will be lost in a tangle of useless information, especially the information that is meant to be 
tracked and disclosed. The problem of generating concept lattices of adequate size and structure to display 
highly important characteristics of the context remains a critical goal of FCA [18], [19]. Even small data might 
generate a substantial number of formal concepts [20], [21]. 

In this day and age of big data, FCA may also be utilized as a powerful tool for analyzing large dataset 
too. Therefore, while working with large datasets, it is essential to have fast and accurate FCA algorithms for 
knowledge discovery and knowledge representation. In recent years, several parallel and distributed 
approaches have been suggested to speed up the process of enumerating all possible concepts (relevant 
concepts). Parallel implementations utilizing the CloseByOne algorithm are proposed by the authors of the 
study [22]. The authors use the MapReduce programming model [23] to develop the first distributed algorithm 
[24], this was accomplished in 2009. Chunduri et al. [25] introduced a distributed FCA method that they termed 
the UNConceptGeneration to extract new knowledge from binary object-attribute relational data. This method 
was based on the fast concept analysis technique that was introduced by Lindig [26]. This paper presents both 
an analysis and an implementation of the FCA Upper Neighbor algorithm. This algorithm extracts the upper 
concepts from binary matrices by making use of the MapReduce architecture, which is a distributed method 
that is typically applied to large datasets. The proposed approach provides scalable, distributed algorithms to 
address the challenges of working with large datasets and generating formal concepts. As a result, they built 
the method utilizing the MapReduce framework, which provides a viable solution. In recent years, MapReduce, 
a novel model for distributed computing, has been increasingly popular due to the prevalence of its 
implementation utilizing Hadoop. Hadoop is built to operate over massive datasets, improving retrieval 
performance. The MapReduce architecture conceals the complexities of parallel processing, such as node 
availability and fault tolerance, which makes the implementation of distributed computing much more 
manageable. In a nutshell, the MapReduce process divides the computation of jobs into two distinct phases. In 
the first step of the process, map functions are used to delegate a computing task to various machines, each of 
which will process a distinct subset of the data. Due to the fact that it stores data using the hadoop distributed 
file system (HDFS), the throughput of MapReduce is particularly effective. In order to facilitate the processing 
of maps at the most local level possible, the data is replicated in blocks and distributed across multiple 
machines. The results of the Map function are referred to as intermediate outputs, and after being temporarily 
stored on the processing machine, they are combined by a function known as Reducer, which is the second 
phase of the MapReduce framework. 

Parallel algorithms, in general, have the drawback of demanding machinery that is outfitted with 
multiple processors or processor cores. This is a disadvantage for general use. In spite of the fact that the trend 
in the development of technology is toward multicore microprocessors, hardware configurations that contain a 
significant number of processing cores remain relatively expensive and uncommon. On the other hand, 
distributed algorithms can be executed on connected commodity hardware if necessary. In general, parallel 
algorithms have significantly lower overheads for managing computations compared to distributed algorithms. 
However, distributed algorithms are more efficient in terms of cost because they may be executed on regular 
personal computers linked together over a network. The reader can get more information about these kinds of 
algorithms here [22]-[25], [27], where in this paper, we will compare and analyse the traditional methods for 
attribute and concept lattice reduction proposed in the literature. 

Although big data methods speed up the calculation to generate concept sets, attribute reduction still 
has its importance in the FCA domain. Attribute reduction of a concept lattice facilitates the discovery and 
expression of hidden information in large datasets, providing a new approach for constructing a concept lattice 
and enhancing the theory of concept lattices, which is essential for theoretical study and practical application. 
The goal of attribute reduction is to identify the smallest number of attributes that can accurately capture the 
concept and structure of the initial formal context. Because the concept lattice of the reduced subset is 
isomorphic to that of the original, the reduced concept lattice may be used to compute the original lattice, 
reducing the complexity of the concept lattice of the original formal context. As a result, we get a smaller 
concept and knowledge graph, only the relevant concepts. This graph can be easier understood and interpreted 
by human users. 

Several methods for attribute and concept lattice reduction are proposed in the literature to tackle the 
problem of generating concept lattices of adequate size and structure to display highly important characteristics 
of the context, each method with its own set of properties. Some methods use a context-level representation to 
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start the process of reducing concept lattice complexity. On the other hand, for some methods using a lattice 
version to reduce complexity, we can perform such lattice level pruning like the iceberg reduction. Generally, 
some reduction methods try to determine the smallest number of objects\attributes that preserve the original 
lattice's structure in the reduction. This paper will put such methods under the pre-processing context category 
[28]-[31]. Other methods, which aim for a high level of simplicity that highlights the most significant features, 
will be categorized as non-essential distinctions elimination [32]-[34]. Alternatively, a method works by using 
a relevance criterion to choose formal concepts, objects, or attributes; these methods will be categorized as 
concept filtration [35], [36]. 

The three categories of concept lattice reduction methods stated above are discussed in further depth 
in this study, and the most important methods for each category are highlighted. The methods are analyzed 
using formal concept analysis, which is based on six major pillars: the preliminary step of the reduction process, 
domain expert, changing the original data structure, final concept lattice, quality of reduction, and category of 
reduction method. In addition to the FCA-based analysis, complexity of an algorithm, scalability, and reliability 
of the concept lattice are evaluated. We used a well-known FCA tool, ConExp [37], for drawing all the line 
diagrams in this study. 

This paper is divided into five sections. A brief review of FCA terminology and notions is offered in 
the section 2, along with a small demonstration example. Section 3 divides the methods for reducing concept 
lattice into three categories: context pre-processing, non-essential distinction elimination, and concept 
filtration. The analysis and comparison of FCA reduction methods are presented in section 4 based on six major 
pillars: the preliminary step of the reduction process, domain expert, changing the original data structure, final 
concept lattice, quality of reduction, and category of reduction method, by using FCA-based analysis. Section 5, 
the last part of this work, discusses future work and concludes this work. 


2. FCA: KEY TERMINOLOGY AND NOTIONS 

FCA is a mathematical field that dates to the early 1980s. Its key feature is the representation of 
information using particular diagrams known as line diagrams (Hasse diagrams), which mathematicians refer 
to as diagrams of concept lattices. To interpret our research, we will provide a brief overview that explains key 
concepts and definitions of FCA and an example that will be used throughout the study. The terminology and 
notions of FCA shown in this paper are based on [2]. 


Definition 1. Formal context: a formal context is denoted by the notation (G, M, /) is a cross table, where G is 
is a group of objects, M denotes a group of attributes, and ] © G X M denotes an occurrences connection that 
exists between G and M. For any object g € G and attribute m € M there is a binary relation gIlm ((g,m) € 
T) which denotes that g (object) has m (attribute). It is usual practice to use a "cross table" to depict a formal 
context. In a formal context, the presence or absence of a link between objects and their attributes is represented 
by crosses and blank spaces or (1s and Os). Such a context is known as a binary context, as shown in Table 1. 

In Table 1, the group of objects G are {0,,02,03,04,0;}, and the group of attributes M are 
{@1, Az, 23,4, As, ag}, if there is a 1’s in the intersection of the row for object g and the column for attribute 
m, then object g possesses attribute m, and if there is 0’s, then object g and attribute m are unrelated. For 
instance, in the Table 1, the attributes {a,,a,} belong to the object 0,. 


Table 1. Formal context 


Objects/attributes ay ao a ay as a 
01 1 1 0 0 0 0 
02 1 0 0 0 0 0 
03 1 1 1 1 0 1 
04 0 1 0 1 1 1 
05 1 0 0 0 0 0 


Definition 2. Derivation operators: for each subset of objects A & G and a subset of attributes B © M ina 
formal context (G, M,1), the derivation operators T = 2° +2” |=2™ + 2° are explicitly defined as A= 
{mE M|Vg €A:(g,m) € 1}, B' ={g € G| Vm € B:(g,m) € J}. This definition holds true for all formal 
contexts. The up operator A! is just the collection of all attributes that all objects from A have in common, and 
the down operator B is the collection of all objects that share all of B 's attributes. The derivation (1, J) 
operators A’ and BY are also known as A’ and B’. For instance, given the formal context depicted in Table 1. 
One can simply notice that: 
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{0,}" = {1,2} 
{0,,0,}' = {a} 

{a,}' = {01, 02,03, 05} 
{a1 Az} = {01,03} 


Definition 3. Formal concepts: is a pair (A, B), derived from a formal context (G, M, 1), such that, A © G which 
indicate the extent part for (A, B), and B © M is indicate the intention part of the formal concept (A, B), such 
that A’ = B, B’ = A. For instance, the pair ({03, 04}, {a2, a4, a6}) is a formal concept according to the formal 
context in Table 1, with {03, 04} being the extent part and {az, a4, a6} being the intent part. 


Definition 4. Concept lattices: given two formal concepts like c, = (A, By), C2 = (Az, Bz) of a context can be 
sorted using the subconcept-super concept sorting relation <, which is described as follows: (A,,B,) < 
(Az, B,) = A, © A, (or equivalently B, © B, ), where c, is a subconcept (more specific) and czis a super 
concept (more general). In a formal context K=(G, M,/), the set of all formal concepts along with the partial 
order < usually defines a complete lattice known as a concept lattice [1], denoted by B(G, M, 1). 

According to the first section of the "concept lattice fundamental theorem" given in [1], a concept 
lattice B(G, M,/) is a “complete lattice” in which the infimum and supremum exist for any random set C © 
B(G,M,1), and given by (Aj, B,) A (Az, Bz) = (A, N Az, (By U Bz)’") and (Ay, By) V (Az, Bz) = ( (Ay U 
Az)", By N Ba). 

The line diagram for the concept lattice is shown in Figure | and it is created using the formal context 
of Table 1, where Figure 1(a), indicates the set of formal concepts created from the formal context and the 
relationship between them, known as the subconcept-superconcept, could make up the concept lattice [2]. The 
line diagram's nodes represent the formal concepts. The terms of object concept ({g}",{g}') and attribute 
concept ({m}’, {m}"") may also be used to explain the labeling of the formal concepts in the line diagram, where 
the object concept is denoted by y(g) and the attribute concept is denoted by u(m). The tagging (labeling) of 
B(G, M, 1) is therefore accomplished in the following manner: the formal concept y(g) is assigned the tag g 
for every object g, and the formal concept u(m) is assigned the tag m for every attribute m. Some concept 
nodes have objects drawn underneath them, while others have attributes drawn above them (usually, concept 
object labeling is drawn below a node in a line diagram while the concept attribute labeling is drawn above the 
node). Not every concept in a context is an object or an attribute concept. Any concept can be an object concept, 
an attribute concept, a combination of the two, or none [38]. From the concept lattice, the extent part (concept 
objects) for a formal concept can be reached by a downward path from the node to capture all concept objects. 
In the highlighted node in Figure 1(b), we can easily notice that { 01, 03} is the extent part of the node, whereas 
the intent part (concept attributes) of the highlighted node is {a,, a2} is obtainable by following the upwards 
path for the highlighted node to capture the concept attributes. 


Figure 1. Concept lattice structure (a) concept lattice of Table | and (b) clarify the path to reach the concept 
objects and concept attributes 


Definition 5. Attribute Implications: an attribute implication is constructed across the set M of attributes. The 
following is the interpretation of the implication of the form (A > B) with (A,B & M);: if an object has all the 
attributes of the set A, it also has all of the attributes of the set B, and in the provided context K = (G, M,/) is 
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the same holds if (A! € B*). Then it is respected by all concept intents. The following are the definitions of the 
support and confidence measures for any implication (A > B): sup(A > B) = |(AUB)'|/|G|, conf(A > 
B) =|(A UB)'|/|A'|. The set J of implications shown in Table 2 are constructed fron the concept lattice of 
Figure 1(a). For instance, the implication a,,a, — a3 denotes those objects with the attributes a, and a, 
(really, just the objecto3) also have the attribute a3. 

The duquenne-guigues (DG) and Luxenberger bases of implications are often used to define the 
association rules in any concept lattice. We make use of the DG implications here. The percentage of confidence 
levels is what distinguishes these two bases. The DG basis has a confidence level of 100%, but the Luxenberger 
basis has a confidence level of less than 100% [39]. The terms “finer”, “consistent set’, “reduct”’, and “core” are 
defined in the following definitions. These concepts are required for reducing the subset of attributes in FCA 
effectively. Attribute reduction aims to reduce attributes while maintaining the number of concepts and their 
relationships, resulting in a new concept lattice that is isomorphic to the original [40], [41]. 


Table 2. Attribute implications derived from a formal context of Table 1 


S.No Implications 
1 a3 > ay 
2, a, > Ay 
3 a; > a, 
4 a3 > a, 
2) Ag > az 
6 a, > ay 
7 a; a, 
8 ag > ay 
9 ay Ag 
10 a, > Ag 
11 a; > Ag 
12 a1,A, > a3 
13 a1,Ag > Az 


Definition 6. For a given two concept lattices L,(B, A,,1,) and L2(B, Az, Iz). If for any (X, YE L2(B, Az, Iz) 
there exists (X', Y’) € L,(B, A;,1,) such that X’ = X, then we said that L,(B, A,, /,) is finer than L2(B, Az, I2), 
denoted by: 


L,(B, Ay, I) Ss Ly (B, Az, In). 


If L,(B, Ay,,) S L2(B, Az, 12) and L2(B, Az, In) < L,(B,A;,1,), If this is the case, we can say that the two 
concept lattices are isomorphic to one another, as indicated by the following: 


L,(B,Ay,1,) = L2(B, Az, 12). 


Let (G, M,1) be a formal context. For any B © M and Ip = 1 (G X B), (G,B, I) is also a formal context, 
which can be interpreted as a subcontext of the original one. However, we can apply the mapping 7 and J , 
which will be rewritten in this subcontext as T? and J? . For any A & G, we obtain that A'? = A' n B, which 
implies that A‘ '? = AN’ q B = A'N D = A". Note that when we consider a subset X & B, then X'¥ = X!. 
As a result, Albits = AlbistB = A’B. 

Let (G, M, 1) be a formal context, for any B © M such that B # @, L,(G,M,I) < L,(G, B, Ig) holds. 


Definition 7. For a given formal context (G,M,J), for a set of attributes B © M such that L(G, B,I,) = 
L,(G,M,I), then B is called a consistent set of (G,M,/). Furthermore, if L, (G, B- {b}, I5_-(}) x 
L,(G,M,J) for all b € B, then B is called a reduct of (G, M,1). The intersection of all reducts of (G,M,1) is 
called the core of (G, M,/). 

The subsequent result is straightforward to validate. Let (G, M,/) be a formal context, for any B © M 
such that B # @, then B is acconsistent = L,(G,B,I,) < L,(G,M,I). Antoni et al. [31], established three 
distinct kinds of attributes in a formal context, which are analogous to the rough set theory described in [42]. 


Definition 8. For a formal context (G, M, J), the set {B;| B; is areduct,i € T}, where T is an index set that 
indicates all reducts of (G, M,/). Then M is divides into three parta as follows: 

a. Absolute necessary attribute (core attribute) b:b € Nicer B;. 

b. Relative necessary attribute c:c € Ujer Bi — Nier Bi- 
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c. Absolute unnecessary attribute d:d € M — Ujer Bj. 

We use the term "unnecessary attribute" to describe a feature that is not necessary. e:e € M — (\jer Bi, 
and can be classified as either "relatively necessary" or "absolutely unnecessary". Assuming that b, c, and d are 
absolute necessary, relative necessary, and absolute unnecessary, respectively, it is obvious that b’ # c',c' # 
d',b' # da’. Let's say that (G, M,1) represents a formal context. Evidently, the following outcomes have taken 
place [40]. 

Corollary 1. The core is a reduct & there is only one reduction. 
Corollary 2.m € Mis an unnecessary attribute <= M — {m} is a consistent set. 
Corollary 3.m € Misa core attribute — M — {m} is not a consistent set. 

Considering the formal context depicted in Table 1, it contains two reduces: B, = {a,, Az, a4,as}, 
By = {a,,Qz,das5,a¢}. As a result, attributes a,,a, and a, are absolutely necessary, a, and d¢ are relative 
necessary attributes, and a3 is the unnecessary attribute. 


3. CATEGORIZATION OF FCA REDUCTION METHODS 

For FCA real-world applications, one essential aspect is the visualization of formal concepts in 
hierarchical order within the concept lattice structure. The magnitude of the concept lattice, which is created 
from a very vast formal context, is one of the most significant challenges associated with this methodology. A 
large formal context results in a concept lattice that is difficult to work with in practice. As a result, problems 
with FCA applications, such as dealing with a large formal context and reducing the size of the concept lattice, 
are highlighted as critical issues in FCA applications [38]. 

Attribute reduction of a concept lattice facilitates the discovery and expression of hidden information 
in large datasets, providing a new approach for constructing a concept lattice and enhancing the theory of 
concept lattices, which is important for theoretical study and practical application. The goal of attribute 
reduction is to identify the smallest number of attributes that can accurately capture the concept and structure 
of the initial formal context. Because the concept lattice of the reduced subset is isomorphic to that of the 
original, the reduced concept lattice may be used to compute the original lattice, reducing the complexity of 
the concept lattice of the original formal context. The literature describes various techniques of attribute 
reduction to control the complexity and size of formal contexts, formal concepts, concept lattices, and 
implications. Popular reduction techniques for improving FCA scalability include iceberg concept lattices [35], 
matrix decompositions [43], [44], conceptual scaling for many-valued contexts [45], the reduction of the 
concept lattices based on rough set theory [46]-[48], and others. 

Priss and Old [49] proposed a kind of categorization of the concept lattice reduction methods called 
“data weeding techniques”, where they categorized the existing reduction methods into four categories. Visual 
reduction methods are the first category. These methods aim at how data is shown without altering the 
mathematical structure of the fundamental concept lattices. Faceting and plain scaling methods are the second 
category; these methods divide the original concept lattices into smaller ones without losing information. The 
divide should make sense in terms of the lattice’s content. Pruning and restricting methods are the third 
category, which includes removing objects, attributes, or concepts from concept lattices using statistical 
methods. The fourth category, decomposition and general scaling methods, decomposes a lattice while also 
reducing complexity, for instance, by aggregating objects or attributes. Their study looked at several "data 
weeding" methods that may be used to shrink a concept lattice and generate "good" graphical representations 
for concept lattices. Techniques for data weeding tend to be very dependent on the type of application. Unlike 
their work, Dias and Vieira [50] classify concept lattice reduction strategies into three groups. In the first group 
of reduction strategies, the context is stripped of superfluous information. The second group of reduction 
approaches simplifies formal context and concept lattices. The third group of reduction strategies is the 
selection of formal concepts, objects, and attributes. Based on their comparison methodology, we will 
categorize the most significant works and recent contributions in the direction of attribute reduction of a 
concept lattice. We will identify three categories of reduction methods for concept lattices. In the first category, 
“context pre-processing reduction,” the methods under this category try to determine the smallest number of 
objects\ attributes that preserve the original concept lattice's structure. In the second category, “non-essential 
distinctions elimination,” these reduction methods aim for a high level of simplicity that highlights the most 
significant features. In the third category, “concept filtration,” such a method works by using a relevance 
criterion to choose formal concepts, objects, or attributes. 

The three categories of reduction methods for concept lattices stated above are discussed in further 
depth in this comprehensive overview, mostly with important primary methods for each category highlighted. 
The methods are analyzed using formal concept analysis, which is based on six major pillars: the preliminary 
step of the reduction process, domain expert, changing the original data structure, final concept lattice, quality 
of reduction, and category of reduction method. Each pillar has its own set of features. In addition to the FCA- 
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based analysis, complexity of an algorithm, scalability, and reliability of the concept lattice are evaluated. It is 
essential to say that the focus of this study will be on classical FCA. The three categories of concept lattice 
reduction methods are defined and discussed in the following subsections, along with each category's most 
important reduction methods. 


3.1. Context pre-processing 

These methods focus on finding a formal context with the fewest possible objects or attributes while 
preserving the concept lattice's structure intact by working on eliminating or transforming redundant 
information in the formal context. It is possible to assert that g € G (set of objects), m € M(set of attributes) 
are redundant if eliminated or altered one of them or all in such a manner leads to leads to generating a concept 
lattice preserves the initial concept lattice’s structure intact. 

A first feasible reduction is to replace a group of objects with precisely identical attributes with a 
single object or a group of attributes that appear in identical objects with a single attribute. A clarified formal 
context has been acquired after such redundancies have been removed. As an illustration, Table 1 of the formal 
context shows that the objects demonstrate that the objects 0,,0, € G such that 0; = oz, then 0,,0,can be 
reduced to a single representative object. Similarly, the attributes a,,a, € M such that a, = ag, then a4, a¢ 
can be reduced to a single representative attribute [2]. 

Removing attributes that a set of other attributes may represent is another type of reduction that does 
not alter the concept lattice’s structure, termed a reducible attribute [2]. More formally, if an attribute m € M, 
and a set of attributes B © M, where m ¢ B, such that m’ = B’, then ym (attribute concept) can be considered 
as the infimum of ub (attribute concepts) where b € B. As a consequence of this, if the attribute m is removed, 
the lattice that is derived from the concluding formal context is equivalent to the initial lattice in terms of both 
its form and its relations. For instance, in the formal context depicted in Table 1, we can notice that a3 = 
{a,,a,}’ such that waz is the infimum of a, and a,. As a result, removing the attribute a, from that formal 
context yields a final lattice (new one) that is similar in form and relations to the initial lattice; the equivalent 
concept lattice structure with reduced tagging is that of Figure 1(a) without concept attribute tagged a3. 
Likewise, eliminating the reducible objects from a formal context can also result in a reduced formal context 
that can be used to derive a concept lattice that is similar in form and relations “isomorphic” to the initial lattice. 
More formally, an object g € G such that y(g) (object concept) is the infimum of y(a) (object concepts) such 
thata € A, where A € Gand g € A, by removing such objects result in a concept lattice that is equivalent to 
the initial lattice in terms of both its form and its relations (isomorphic) [2]. 

In order to minimize the size of the formal contexts while preserving the integrity of the concept lattice 
structure, numerous approaches have been developed. Zhang ef al. [51], the authors suggested the use of a 
"discernibility matrix" in order to construct a minimal set of characteristics. This was accomplished by 
considering (G,M,1I) as a formal context and two formal concepts (A;, Bj), (Az, Bz) € B(G,M,I). The 
symmetric difference between the intention parts B,, Bz determines the discernibility between the concepts 
(A;, B,), (Az, Bz) as follows: Dis (4,\p,),(A>,B,) = (Bi U Bz)\(B1 M Bz). After establishing discernibility matrix 
(Dis) from a given formal context (G, M,/), a minimal set of attributes B © M may be identified that resulting 
in a lattice B(G, M,1') isomorphic to the original B(G, M,I), where I' = I N (G x A) and A is the minimal set 
of attributes with the smallest cardinality. Qi et al. [52] the authors set guidelines for reducing the number of 
discernibility computations while maintaining the potential for getting a minimal set of attributes. Zhang et al. 
[51] determine if a formal context's attributes are “absolutely necessary”, “relatively necessary”, or “absolutely 
unnecessary”. To be "absolutely necessary," an attribute must be present in every minimal set. It is "relatively 
necessary" if it is included in at least one minimum set but not all of them. Lastly, an attribute is " absolutely 
unnecessary " if it is not present in any minimal set. 

Medina [29] looked at attribute reduction in three frameworks: “object-oriented concept lattices”, 
“property-oriented concept lattices”, and “concept lattices”. Regardless of the framework, it has been 
discovered that the attributes may be divided into three degrees of requirement, with the attribute reducts being 
similar at each level. Wang and Zhang [53], the authors describe the process of reducing formal contexts by 
eliminating attributes. Wang ef al. [54] offers a heuristic method for identifying the smallest possible collection 
of attributes. To deal with approximation sets, the authors in their work [54] expanded the work of [51]. 

Belohlavek [55], the author presented a method for factorizing concept lattices according to concept 
similarity. It has also been shown how to efficiently compute similarity relations. They developed and 
investigated the relationships between similarity at three levels: the similarity of concept lattices, the similarity 
of the set of objects\attributes, and finally, the similarity of formal concepts. The granular structure of concept 
lattices is investigated in [56] and how it might be utilised to decrease knowledge in FCA. Both attribute 
reduction and size reduction in concept lattices were discussed in the of [57], which was an important 
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contribution to FCA research. The authors offer a method for concurrently shrinking both attribute sizes and 
concept lattice sizes by employing an irreducible-cut concept lattice. 


3.2. Non-essential distinctions elimination 

These methods aim for a high level of simplicity that highlights the most significant features by 
extracting non-essential distinctions (based on some criterion) between formal concepts, sets of objects, or 
attributes from a formal context or a concept lattice. When applied to the information retrieval process [58], 
object clustering's primary objective is to lessen the number of dimensions that concept lattices contain. In this 
work, object equivalence groups are constructed by the application of the singular value decomposition (SVD) 
method. “SVD” is one of numerous linear algebra matrix decomposition techniques for reducing a large matrix 
to a smaller one. The authors construct an equivalency relation using reduced matrices created by the SVD 
approach, as described in the following: let h,, hz be two objects denoting documents that are equivalent if and 
only if the cosine of the angle between them reaches a specified threshold. To decrease the formal context, 
non-negative matrix factorization was applied [59]. Recently, Sumangali and Kumar [60] have developed a 
unique strategy that decomposes the original context in terms of dimensionally reduced low-rank matrices 
comprising real columns and rows using the CUR matrix decomposition technique. As a result, using CUR 
decomposition in FCA reduction techniques might help us extract the most significant data from the datasets. 

The authors used fuzzy k means (FKM) clustering to make the concept lattices more manageable. 
Using equivalence relations developed from FKM clustering, the context matrix is shrunk, and quotient lattices 
are generated. Each variable represents a range of membership levels; a given record may have many cluster 
memberships [21]. In their work, Cheung and Vogel [58] introduce a new technique to reduce the formal 
context significantly. They accomplish this by identifying a new object g derived by the intersection of all 
objects within a parity group whose attributes match the union of the attributes of the elements |g]. For instance, 
if J is the incidence relation in the initial formal context, then g'={olm for each o € |g|} in the reduced formal 
context. Considering the formal context in Table 1, Assume that all equivalence classes have a size of one, 
with the exception of a class containing objects 0, and 03. In this instance, objects 0, and 03 are replaced by a 
new object g, which shares the attributes of both 0, and 03: g’'=0; U 03={@,, Az,a3,@4,a¢}, while the 
remaining objects are unaffected. 

To facilitate the reduction process, domain experts (prior knowledge) about the issue area can be 
integrated into the reduction method. This is done in several studies like [18], [20], [61]. Junction based on 
object similarity (JBOS) uses expert knowledge of a domain (prior knowledge) to replace similar objects with 
representative elements that are similar to a certain degree [32]. To assess the similarity of objects, they used 
weight assignment (0 < w™ < 1) to every attribute m € M, where w™ is used to measure the relevancy of an 
attribute in the range of 0 (no relevance) to | (high relevance) and should be determined by an expert of the 
domain. The degree of similarity between objects like g,, gz is stated by a range from 0 (absolutely dissimilar) 
to | (absolutely similar), defined as follows: 


sim(g1, 92) = Xmem H(91, 92,M)/XimemW™, 


w™ se(g1,m) EI © (gz2,m) E€ 1 
0 otherwise. 


U(Gu Gom)=| (1) 


where g;, Jz are similar if their similarity is greater than the threshold. 

JBOS employs the intersection of the attributes of the objects that are regarded as similar, in addition 
to using prior knowledge. This feature prohibits objects from being created outside of the formal context. The 
research presented in [18] showed that it is feasible to drastically diminish a formal context by studying the 
implications given by the JBOS technique while also achieving sufficient performance on a particular work. 
By eliminating incidents from a formal context, the authors managed the complexity of a concept lattice [62]. 


3.3. Concept filtration 

The methods under this category work by using a relevance criterion to choose formal concepts, 
objects, or attributes. In many circumstances, more knowledge about the sets of objects and the sets of attributes 
is available. This knowledge is used by certain filtration methods to direct the reduction process. Some of these 
methods make use of attribute weighting [20], [62]. Wu ef al. [56] suggested using all of the user's prior 
knowledge to set limits on attributes. Only formal concepts that fulfill the limitations are maintained when 
constructing the concept lattice and formal concepts. [20] present an approach that uses a weight assignment 
to every attribute to convey its importance and then chooses formal concepts that are judged important by using 
the same method of [32] the authors in their work attempting to capture the significance of concepts through 
information conveyed by weights. Zhang e/ al. [61], the authors gave weights to attributes to associate 
significance with formal concepts. The importance of formal concepts is quantified similarly to the authors’ 
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work here [20]. However, the previous authors solve concept lattice completeness difficulties by establishing 
virtual formal concepts. 

Object, attribute, and concept selection procedures that employ a relevance criterion are types of 
concept filtration methods. Pasquier et al. [63] contributed significantly by connecting frequent items and 
formal concepts. The terms "support" and "frequent sets" are described as: Let B € M, where M is a set of 
attributes and Sup(B, G), is the count of objcts in G that contain all the attributes of B. We can say that a set 
of attributes B € M is frequent iff Sup(B, G) = minSup(minimal support previously set). Iceberg concept 
lattices are concept lattices constructed by selecting only frequent item sets. In this particular instance, the 
generated lattice is partial; just the formal concepts that occur most frequently were employed. i.e., given two 
formal concepts (A;, B,), (Az, Bz), where (A,,B,) < (Az, Bz), sup (B,, G) < sup (Bz, G), where G is the sets 
of objects. Stumme ef al. [35], the authors present the "Titanic algorithm" for the creation of “iceberg concept 
lattices” and illustrate the functionality of these lattices in a variety of applications, like mining association 
tules, the visualization of implications as well as the analysis of “large-scale” databases 

Soldano et al. [64], the authors presented a new model of concept lattice that can be considered of as 
being similar to iceberg concept lattice. The concept lattice that was produced as a result was given the name 
"alpha concept lattice" by the authors. An unlimited lattice that only contains frequent formal concepts can 
form the basis of what is known as an iceberg concept lattice. The frequency of specific formal concept 
elements is used by several strategies to make their selections. For instance, a formal concept can be chosen if 
it meets a limitation a@(user parameter). These limitations, like |A] > @,|B| > @ or (|A| x |B|) > aandso on, 
in the area of data mining, such limitations are considered in [65]. In concept lattice building and implication 
extraction, Belohlavek and Vychodil [66] developed a set of restrictions that could be directly exposed to 
derivation operators and evaluated their use. The authors put out a similarity metric for fuzzy formal concepts. 
A subset of formal concepts related to one another is chosen using the similarity measure; this subset may be 
much less than the initial set of formal concepts. Below is the definition of a metric used to compare formal 
concept extensions. 


Definition 9. Given two formal concepts (A;, B,), (Az, Bz), the similarity between the extensions A, and A, 
given by Sim(A,, Az)=1- |A, N A5|/ JA, U AI. 

This method is also discussed in [67], [68]. When minimizing the number of formal concepts, the 
quality of the formal context is also important to consider. Many researchers have shown how difficult it is to 
construct reliable concept lattices when noisy underlying data is. Several concept filtration methods are 
considered for this [69]-[72]. 

The term “stability” is associated with formal concepts, so judgment based on stability is a regularly 
employed strategy. Stability tries to construct an indicator for concepts that illustrates how the intention of the 
concept varies depending on the collection of objects. All formal concepts with a stability index less than a 
certain threshold are eliminated [69]. Due to the necessity to construct all subsets of each formal concept, the 
computation of this indicator is #P-complete. Kuznetsov et al. [70] proposed two heuristics to get around this 
difficulty. Markov chains were offered to calculate the stability index [73]. Klimushkin el al. [72] addressed 
how probabilistic factors are utilized in the probability-based method to choose specific formal concepts. They 
referred to this as "the selection of specific concepts". The factor is defined according to the likelihood that a 
specific object possesses a particular property. 


4. ANALYSIS AND COMPARISON OF REDUCTION METHODS 

In this section, we will analyze and compare the reduction methods that previously described each 
one with its category. The methods will be analyzed using formal concept analysis, which is based on six major 
pillars: the preliminary step of the reduction process, domain expert, changing the original data structure, the 
final concept lattice, quality of reduction, and category of reduction method. In addition to the FCA-based 
analysis, the complexity of an algorithm, scalability, and reliability of the concept lattice are evaluated. We 
used a well-known FCA tool, ConExp [37], for drawing all the line diagrams in this study. 
a) The preliminary step of the reduction process 

In this pillar, the focus is on the specific stage of the concept formation process at which the reduction 
methods being analyzed can be applied. Some methods [74] may be appropriate for use on the formal context. 
Other methods [75] may be more suitable for use on the set of formal concepts itself, which is a complete and 
irreducible set of concepts that can be derived from the formal context. Still other methods [76] may be more 
appropriate for use on the concept lattice, which is a graphical representation of the relationships between the 
concepts in the set of formal concepts. 
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b) Domain expert 

In this pillar, the focus is on whether or not the reduction methods being analyzed incorporate prior 
knowledge from domain experts in the reduction process. Some methods [32] incorporate this knowledge, 
while others [77] do not. This distinction is important because the use of prior knowledge from domain experts 
can potentially improve the quality of the reduction process and the resulting concept lattice. On the other hand, 
not using prior knowledge may result in a more objective and unbiased reduction process but may also 
potentially compromise the quality of the results. 

c) Changing the original data structure 

In this pillar, the focus is on reduction methods that involve modifying the set of objects, attributes, 
or occurrences (incidence relation) in the formal context or concept lattice. Some methods [32] may modify 
the set of objects, which are the entities being described in the formal context. Other methods [78] may modify 
the set of attributes, which are the properties or characteristics being used to describe the objects. Still other 
methods [61] may modify the occurrences or incidence relation, which is the relationship between the objects 
and attributes in the formal context. These modifications can have a significant impact on the resulting concept 
lattice and may be used to achieve specific goals in the reduction process. It is important to carefully consider 
the potential consequences of making such modifications before using these methods. 

d) Final concept lattice 

In this pillar, the focus is on the characteristics of the final concept lattice resulting from the reduction 
process, as compared to the original concept lattice. Some methods [29] result in a final concept lattice that is 
isomorphic to the original, meaning that it is structurally identical and preserves the relationships between the 
concepts. Other methods [61] result in a final concept lattice that is partial to the original, meaning that it is a 
subset of the original concept lattice and contains only a portion of the concepts and relationships. Still other 
methods [32] result in a final concept lattice that is different from the original, meaning that it is structurally 
distinct and does not preserve the relationships between the concepts in the same way as the original. 

e) Quality of reduction 

During the process of reduction, it is possible that some of the information that is being represented 
by a concept lattice will be lost. It is important that such a loss be quantified in an objective manner using 
indicators that reflect the ability of the concept lattice and that enable comparisons to be made between the 
various techniques. A concept lattice's complexity is typically evaluated using a variety of different metrics, 
including the cardinality of the covering relation, the number of formal concepts, objects, or occurrences, and 
the attribute cardinality. Other reduction methods utilize particular metrics for selecting formal concepts in the 
original lattice that represent the quality of the final lattice from the standpoint of the specific methodology. 
Indexes depending on stability [69], frequency [35], or distance [79] are only a few examples. Such metrics 
may not be applicable throughout all situations. Furthermore, such metrics exclude losses linked to structural 
characteristics of the lattice, which can be significant. 

Dias and Vieira [32] suggested indexes related to the usage of implications as a different perspective 
on the knowledge provided by the lattice. The objective is to compare the efficiency of such lattices using the 
sets of implications that correspond to the original and reduced formal contexts. The adjustments made to 
shrink the concept lattice are intended to be mirrored in the set of implications, decreasing the capacity of the 
implications to characterize a collection of reference objects. Two measures are provided to quantify the degree 
of comparability between both the original formal context and the reduced formal context: fidelity and 
descriptive loss. Let G, represent the set of objects from the original formal context, G, represent the set of 
objects from the reduced formal context, and 7, 79,..., 7% indicate the set of rules derived from the reduced 
concept lattice. 

The fidelity (F) metric is used to find out the percentage of successful rule applications to original 
objects. If there is a rule 7, = P > Q for which P C g' and Q ¢ g’ for an object g € Go, the rule 7; is said to 
have failed. The following equation calculates fidelity: 


F = (ZEa(1 — )/k) x 100, (2) 
where, Nf; denotes the number of failures of the rule 7;(i.e., the number of objects in G, for which the rule 
fails:). As a result, F is the percentage of non-failures when considering the |G,| x k applications of k rules 
applied to |G, |objects. To avoid numerous contributions to F from objects with precisely the same attributes, 
such objects must be reduced to just one object first. 

The descriptive loss (DL) is a metric that quantifies the reduction in the capacity to characterize a set 
of G, objects as a result of the removal of attributes during the reduction process. In order to define it, an “onto 
function” denoted by v: G, > G,r will be utilized. This function will map each object g in G, onto an object 
v(g) in G,. This mapping occurs as a direct result of the reduction process, which may have resulted in g being 
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simplified. It is presumed that the object (g) will have its attributes comprised of a selection from the set of g 
attributes, i.e., According to the definition provided in [32], the descriptive loss measure can be written as: 


1w(g)'| 
> oO 7 
DL= ( = a) x 100 (3) 


IGol 


where |v(g)'| is the reduced object's attribute count v(g) and |g’| is the original object's attribute count g. 
When attributes that are regarded "redundant" in the real application are removed, DL might show descriptive 
losses. As a result, such attributes must be eliminated before the reduction technique may be used. 

It is important to note that the level of fidelity is decided by a collection of implications that are 
obtained from the concept lattice. This collection of implications indicates the degree to which the information 
that is available in the generated formal context is consistent with the original. The descriptive loss is evaluated 
directly on the acquired formal context, and it evaluates the loss of capacity to represent the features that were 
provided in the framework that was given in the initial formal context [32]. Fidelity is a valuable metric for 
assessing non-essential distinctions elimination methods and concept filtration methods. Descriptive loss is 
beneficial for all types of reduction methods, but it requires formal context accessibility. This pillar considers 
several quality indexes of the final lattice, such as fidelity and descriptive loss, as stated above. 

f) Category of the reduction method 

Eventually, the method's reduction category is outlined in the last pillar. Three categories comprise 
all methods: context pre-processing methods, non-essential distinctions elimination methods, and concept 
filtration methods, which have been covered in section 3. 

Based on the six pillars described above, we will analyze and compare the most important reduction 
methods in the literature. The formal context (binary table) shown in Table 3 highlights the reduction methods, 
where objects denote the method's reference and attributes indicate a method's features. Figure 2, depicts the 
conceptual lattice (Hasse diagram) derived from the formal context shown in Table 3. At first glance, we can 
notice that all reduction methods modify the occurrence (actually, attribute p3c). 
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Figure 2. Concept lattice of reduction methods extracted from the formal context in Table 3 
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Table 3. Formal context (binary table) represents the reduction methods 
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The preliminary step of the reduction process (Pilarl): pla: formal context, p1b: formal concepts, plc: concept lattice. 

Domain expert (Pilar2): p2a: used), p2b: unused. 

Changing the original data structure (Pilar3): p3a: set og objects, p3b: set of attributes, p3c: occurrences(incidences). 

Final concept lattice (Pilar4): p4a: isomorphic to the original lattice, p4b: partial from the original lattice, p4c: different from the original 
lattice. 

Quality of reduction (Pilar5): p5a: fidelity losses, p5b: descriptive losses. 

Category of reduction method (Pilar6): p6a: Context pre-processing, p6b: Non-essential distinctions elimination, p6c: Concept filtration. 


Deleting characteristics (attributes)\objects modifies the set of occurrences (set of incidences) in the 
category of context pre-processing methods. Likewise, by grouping "similar" objects, "non-essential 
distinctions elimination reduction methods” diminish the occurrence relation (set of incidences). While 
choosing a set of formal concepts, a set of objects, or a set of attributes, concept filtration reduction methods 
modify the set of occurrences (set of incidences) by excluding any of those items. The line diagram also clearly 
shows that all reduction methods exhibit some descriptive losses (attribute p5b). 

A. Pillar 1 (The preliminary step of the reduction process) 

Considering the preprocessing steps for attribute or concept set reduction, one of the key elements is 
the selection of the object set representation. Based on the literature review, the relationships among the 
dominating approaches can be presented in the form of a concept lattice as shown in Figure 3. The performed 
analysis on the reduction methods in the literature highlights the following key approaches for preparing the 
appropriate object set. 

— One important aspect is the format of the data sets, namely it can be a) a context, b) a concept set, or c) a 
concept lattice. It can be seen that the different object set alternatives provide different ways and different 
options for the later reduction steps. For example, using a context-level representation, the attribute-level 
operations are more flexible and efficient than a lattice-based approach. On the other hand, using a lattice 
version, we can perform such lattice-level pruning like the iceberg reduction. The most dominating 
approach is the application of concept set representation; we can mention here, among others, the works 
[29], [51]-[54], [70], [77]-[80]. Regarding the exceptions, we can mention the method in [2], which has 
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formal context as a point to start the reduction process and the method in [81], which has both formal 
concepts and concept lattice as a point to begin the reduction process. Under the non-essential distinctions 
elimination category, all of those [32], [33], [43], [59], [60], [82]-[85] reduction methods have formal 
context as a preliminary step for the reduction process, except for the method in [86], which has both 
formal context and formal concepts as a point to start the reduction process and the method in [76], which 
has both formal concepts and concept lattice as a preliminary step for the reduction process. In the concept 
filtration category, eight methods [65], [71], [79], [82], [87]—[90] use formal context as a preliminary step 
for the reduction process, four methods [35], [64], [91], [92] use concept lattice as a preliminary step for 
the reduction process, three reduction methods [61], [69], [73] use both formal concepts and concept 
lattice as a preliminary step for the reduction process, and two reduction methods [20], [36] use both 
formal context and formal concepts as a preliminary step for the reduction process. 
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Figure 3. Concept lattice of Pillar 1. 


— Besides the generation of the object set representations, another key element is the application of standard 
data conversation methods. These methods transform the attribute domains of real values into categories 
using a discretization technique. 

— Application of statistical attribute reduction methods. In this case, the dominating approach calculates the 
correlation between the attributes to determine the strong relationships. Regarding the popular methods, 
we can mention the PCA and SVD algorithms. The preprocessing step's main benefit is that it yields a 
smaller dataset requiring fewer costs during the computations. 

B. Pillar 2 (Domain expert) 

The approach can be enhanced by using domain experts (prior knowledge) about the domain problem 

to lead the reduction process. In this context many works integrated the prior knowledge for better reduction such 
s [17], [70]. The JBOS approach [15] is used in the clustering domain based on prior knowledge. It aims to 
substitute a set of similar objects with relevant objects based on a qualitative evaluation of their features. Other 
methods, like [52], and [72], suggest restricting features using prior knowledge. These restrictions are known as 

"attribute-dependency rules." Only formal concepts that adhere to the restrictions are kept when constructing 

formal concepts and the concept lattice. In this study, the analysis and comparison of the reduction methods for 

each category under the concept lattice of this pillar as shown in Figure 4 revealed the following features: 
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— For the most popular reduction methods in the literature that we selected in this study under the category 
of context pre-processing, no domain expert (prior knowledge) has been used to assist the reduction. 

— Only five strategies [32], [59], [83], [84], [86] in the non-essential distinction’s elimination category 

involve a domain expert to lead the reduction process. 
— Only five strategies [20], [61], [82], [87], [88] in the concept filtration category involve a domain expert 
to lead the reduction process, and twelve do not. 


Li & Wang, 2016 
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Figure 4. Concept lattice of Pillar 2 


C. Pillar 3 (Changing the original data structure) 

A first feasible reduction is to replace a group of objects with precisely identical attributes with a single 
object, or a group of attributes that appear in identical objects with a single attribute. A clarified formal context 
has been acquired after such redundancies have been removed. More precisely, the formal context depicted in 
Table 1 demonstrates that the objects 0,,0; € G such that 0; = og, then 0,,0;can be reduced to a single 
representative object. Similarly, the attributes a,,a, € M such that a, = ag, then a,, a, can be reduced to a single 
representative attribute [2]. A variety of methods have been founded with the aim of reducing formal contexts. JJ 
Qi [52] the authors set guidelines for reducing the number of discernibility computations while maintaining the 
potential for getting a minimal set of attributes. Zhang ef al. [51] determine if a formal context's attributes are 
“absolutely necessary”, “relatively necessary”, or “absolutely unnecessary”. If an attribute appears in all minimal 
sets, it is considered “absolutely necessary”’. If it appears in at least one, but not all, minimal sets, it is considered 
“relatively necessary”. Lastly, if an attribute is not included in any minimal set, it is “absolutely unnecessary”. Based 
on the literature review, the dominant approaches' relationships can be presented as a concept lattice as shown in 
Figure 5. The performed analysis of the reduction methods in the literature highlights the following key features: 

— In the pre-processing context category, all reduction methods alter the set of attributes and the set of 
occurrences (incidence relation), except one method [2] that also changes the set of objects. 

— Inthe non-essential distinctions elimination category, all methods modify occurrences (incidences); as well 
as another two methods [32], [83] modify the set of objects, five methods [59], [60], [84]—[86] alter the set 
of attributes too, and one [33] alters both the set of objects and the set of attributes along with incidences. 

— Inthe concept filtration category, all reduction methods alter the set of attributes, set of objects as well as 
the set of occurrences (incidence relation), except two methods, one of them [71] doesn’t alter the set of 
attributes and the other one [36] alters the set of incidences only. 

D. Pillar 4 (Final concept lattice) 

Regarding the structure of the generated reduced concept sets, we can distinguish the following main 
approaches: a) only concept set without lattice; b) concept hierarchy, c) not-isomorphic concept lattice, d) 
isomorphic sub concept lattice or e) not-isomorphic concept lattice. In Figure 6, these variants are depicted 
pillar 4. The analysis and comparison of the reduction methods for each category under the concept lattice of 
this pillar revealed the following features: 
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— In this case, we get the list of the selected concepts without the ordering relationship among them. The 
main benefit of this approach is the simplicity; the performed calculations require fewer time costs. On 
the other hand, the result can not be used to show the specialization relationship. 

— The hierarchy structure means a simplification of the original lattice structure. This structure is very popular 
in many fields of knowledge engineering and software engineering; it can be used for additional processing 
steps. We can mention the single parent class approach in UML or the taxonomy of some ontology. 

— Under the non-essential distinction’s elimination category, there are no methods for constructing an 
isomorphic final lattice or a subset of the original concept lattice. All of them have a different final lattice 
than the original. In this case, only some selected concepts remain in the goal lattice, and this method 
results in a different ordering relationship. Another approach is when the output reduced lattice consists 
of blocks of the input lattice, and the edges correspond to the ordering relationship among these blocks. 

— In the concept filtration category, all methods generate a final lattice that is a part of the original lattice, 
with no resultant lattice being isomorphic to the original but only one [71] being different. 

— In this case, the input and output concepts are isomorphic, usually having a different attribute set. The 
main goal is to keep only those attributes representing the existing relationship in the input space. 
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Figure 5. Concept lattice of Pillar 3 


E. Pillar 5 (Quality of reduction) 

During the process of reduction, it is possible that some of the information that is being represented by a 
concept lattice will be lost. It is important that such a loss be quantified in an objective manner using indicators that 
reflect the ability of the concept lattice and that enable comparisons to be made between the various techniques. A 
concept lattice's complexity is typically evaluated using a variety of different metrics, including the cardinality of the 
covering relation, the number of formal concepts, objects, or occurrences, and the attribute cardinality. Other 
reduction methods utilize particular metrics for selecting formal concepts in the original lattice that represent the 
quality of the final lattice from the standpoint of the specific methodology. Indexes depending on stability [69], 
frequency [35], or distance [79] are only a few examples. Such metrics may not be applicable throughout all 
situations. Furthermore, such metrics exclude losses linked to structural characteristics of the lattice, which can be 
significant. Figure 7, depicted the concept lattice for pillar 5 derived from Table 3. The analysis and comparison of 
the reduction methods for each category under the concept lattice of this pillar revealed the following features: 

— For the analysis and comparison of the reduction methods in this work we will consider two measures to 
quantify the degree of comparability between both the original formal context and the reduced formal 
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context: a) fidelity loss and b) descriptive loss; fidelity loss is determined from a collection of implications 
derived from the concept lattice, which measures the level of consistency of the information available in 
the generated formal context concerning the original. The descriptive loss is calculated directly on the 
acquired formal context and assesses the loss of capacity to express the features provided in the model 
given in the original formal context [23]. Fidelity is a valuable metric for assessing non-essential 
elimination methods and concept filtration methods. Descriptive loss is beneficial for all reduction 
methods but requires formal context accessibility. In the pre-processing context category, we can mention 
that no method has fidelity loss, but they all have a descriptive loss. 

— Under the non-essential distinction’s elimination category, all reduction methods have fidelity loss and 
descriptive loss, except one method [59] which has only descriptive loss. 

— All reduction methods in the concept filtration category have both fidelity loss and descriptive loss. 
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Figure 6. Concept lattice of Pillar 4 
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Figure 7. Concept lattice of Pillar 5 
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F. Pillar 6 (Category of reduction method) 

Figure 8, shows the concept lattice for each category of reduction methods that were derived from 
Table 3: Figure 8(a), indicates the concept lattice for context pre-processing methods, Figure 8(b) indicates the 
concept lattice for non-essential distinctions elimination methods, and Figure 8(c) indicates the concept lattice 
for concept filtration methods. The ten methods in the pre-processing context category are distinguished 
because most methods begin the reduction process with a set of formal concepts and do not employ previous 
information. All reduction methods provide an isomorphic final lattice to the original. Most of the seventeen 
methods in the category of concept filtration modify the set of objects, attributes, and occurrences and yield a 
final lattice that is a part of the original lattice. In a nutshell, the majority of techniques for non-essential 
distinctions elimination methods begin the process of reduction with the formal context, which ultimately leads 
to a concept lattice that is not isomorphic and is not a subset of the initial concept lattice. Some are using prior 
information to aid in the reduction and elimination process. 
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Figure 8. Concept lattice for each category of the reduction methods (a) context pre-processing methods, 
(b) non-essential distinctions elimination methods, and (c) concept filtration methods 
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Context pre-processing methods yield a final lattice with fewer attributes, ideal for applications that 
need direct interaction with the user, either via a basic analysis and visualization of the lattice or exploitation 
of the new formal context's implications. Non-essential distinctions elimination methods are often used in the 
formal context and have a level of complexity that allows for controlling extremely vast formal contexts. The 
final concept lattice may diverge significantly from the original and have an inadequate quality. Concept 
filtration methods operate by reducing the space of concepts based on some relevance criterion, such as using 
an objective function on the concept lattice to eliminate pathways that appear to be irrelevant. The building of 
iceberg concept lattices, introduced by [35], is a highly respected approach in this category of methods. The 
downside of this method seems to be that significant formal concepts may be ignored over the process. Some 
techniques list all formal concepts first, then use a criterion to choose the ones that are relevant [20], [61], [69]. 
A strategy that focuses on this method can be costly because the whole search space is investigated. 


5. CONCLUSION AND FUTURE WORK 

FCA is an appropriate tool in many application areas for constructing concept (class) hierarchies or 
lattices to provide an efficient knowledge representation. One promising area is the field of software engineering, 
where concept lattices can be used to build standard UML diagrams. In this case, the goal of the reduction module 
is to select the relevant key classes for model generation needed for UML modeling. The proposed engine will 
extract the context from existing documents, and the generated FCA lattice will be converted into a UML model. 
Another important application area is automated educational systems, where the knowledge base covers content 
ontology and the student's competency maps. One way to generate the knowledge units in the content ontology is 
the application of FCA tools with concept reduction to determine only the important elements in the ontology 
model. FCA can also be used as a general method for clustering, too. In this case, every concept in the lattice 
corresponds to a cluster of the covered atomic items. The reduction here is a powerful step to determine only the 
important clusters in the hierarchy. A special case of clustering is the field where items are represented with 
sequences like genes in bioinformatic or event sequences in business process mining. Another important FCA 
application area is association rule mining; the goal of association rule mining, a subfield of data mining, is to 
find intriguing common patterns, correlations, or associations in a dataset. Various issues, including the extraction 
of redundant rules, the enormous volume of extracted rules, and the potential loss of significant rules, make it 
difficult to extract traditional association rules from large amounts of data. Integrating formal concept analysis 
into the mining process becomes one of the most promising applications to solve these issues. Only the frequently 
closed item sets may be directly derived when FCA is used in the association rule mining (ARM) process. 
Utilizing FCA to extract association rules (ARs) is regarded as one of the lattice-based techniques (LBT) that 
works better than the related traditional mining techniques compared to the extraction time. 

Additionally, without reprocessing the raw data, ARs may be derived with various pairings of minimal 
support and confidence using the resulting formal concept lattice. As a result, database searches might be 
considerably reduced. Therefore, it has been noted that a formal concept lattice is a crucial tool for obtaining 
the association rules basis, which has been hypothetically demonstrated to be a minimal, efficient, and 
nonredundant association rule set. Additionally, association rule mining with formal concept analysis may be 
utilized to categorize data efficiently. Concept lattice reduction techniques are useful for reducing the 
complexity of the concept lattice and improving its readability and usability in various application areas. The 
choice of the appropriate reduction technique depends on the specific goals and requirements of the application 
and the trade-off between the quality of the resulting concept lattice and the computational complexity of the 
technique. In this work, we suggest categorizing concept lattice reduction methods into three main categories: 
First, context pre-processing methods that try to identify the smallest number of objects\attributes that will 
preserve the original lattice's structure. Non-essential distinctions elimination methods are the second category. 
They aim to keep things as simple as possible while highlighting the most important features. Finally, concept 
filtration methods aim to choose formal concepts, objects, or attributes based on a relevance criterion. Each of 
these categories has its own strengths and limitations, and the appropriate technique will depend on the specific 
needs and goals of the application. Overall, it was determined that context pre-processing methods, which 
remove redundant information and create a concept lattice with fewer attributes, may be effective in situations 
where user interaction is important. Non-essential distinctions elimination methods, which simplify the concept 
lattice by reducing the space of concepts, may be more suitable for cases where the size of the concept lattice 
is more important than its quality. Concept filtration methods, which select relevant concepts based on a 
criterion, can be computationally expensive if they require enumerating all formal concepts before applying 
the selection process. 

Thirty-eight methods were chosen from among the most important methods in the literature. The 
methods have been compared, analyzed, and categorized using six major pillars: a preliminary step of the 
reduction process, domain experts, changing the original data structure, final concept lattice, quality of 
reduction, and category of reduction method. The analysis was carried out using FCA, where the selected 
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methods are summarized in a formal context. It was demonstrated that all reduction methods modify the 
occurrence (incidences) and result in some level of descriptive loss. Considering each category of methods, 
most reduction methods under the context of pre-processing start the reduction process from the set of formal 
concepts and modify the set of attributes. They don’t use prior information (domain expert), resulting in a final 
isomorphic lattice to the original. While most methods for eliminating non-essential distinctions use formal 
context as a preliminary step in the reduction process, and some involve a domain expert to lead the reduction 
process, where there are no methods for constructing an isomorphic final lattice or a subset of the original 
concept lattice, all of them have a different final lattice than the original. Considering the methods of concept 
filtration, most have the formal context as a preliminary step for the reduction process. Most don’t use domain 
experts, resulting in a final lattice that is partial to the original lattice. Furthermore, all reduction methods in 
this category have both fidelity loss and descriptive loss. 

The most significant reductions are those that can significantly reduce the amount of space to be 
examined. Such reductions are acquired through "non-essential distinctions elimination" and "concept 
filtration" methods. Methods for "non-essential distinctions elimination" may be regarded as "risky" because 
they have the potential to alter the set of formal concepts dramatically. It's important to ensure that the process 
that leads to these changes keeps the concept lattice's core components. Although concept filtration methods 
are intriguing because they reduce the concept space, traversing such a place should be done in such a way that 
the important concepts are reached. Our future work will move in two directions: first, we will categorize the 
reduction techniques for FCA extensions such as fuzzy formal context, fuzzy concept lattice, rough set theory, 
and decision formal context, based on our categorization for the reduction methods of classical FCA in this 
work. Second, based on the findings of the reduction methods analyzed in this study, we will choose the 
methods that appear to be the best for making some modifications and improvements. 
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